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One philosophical issue underlying instructional 
evaluation prises from tension between the concepts of academic 
freedom and academic responsibility. Academic freedom can be cited as 
an argument against evaluating teaching in circumstances where 
evaluation genuinely encumbers the pursuit or di ssemination of 
knowledge. Another source of tension centers around the notion that 
instructor's credentials, position, and expertise preclude 
evaluation. Some feel that' until we know mpre about evaluation, we 
should do nothing. Others feel that failure to evaluate may lead to 
sins more grievous than would evaluation with uncertainty. In the 
face of increasing demands placed upon faculty to furnish more and 
more evidence supporting the amount, quality, and value of what they 
do, many faculty members want to do away with intrusive evaluation 
policies, while students and administrators push for objective data ■ 
to facilitate decisions and guarantee educational quality. A major 
psychometric issue in instructional evaluation is whether evaluation 
should focus on instructional outcomes or on instructional processes. 
The problems of evaluating teaching by measuring student learning are 
both philosophical and methodological. On the othfcr hand, evaluating 
teaching by measuring course and instructor characteristics and 
behavior that are indicative of good teaching is no less complex. A 
reasonable goal would be to devise an evaluation plan that takes 
advantage of the strengths of process measures, but that also guards 
against their weaknesses. (LH) 
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EVALUATING TEACHING 



Perhaps .the most important philosophical issue underlying instructional evaluation anses from a certain 
tension between the concepts of academic freedom and academic responsibility. t 

Academic freedom is sometimes cited as an argument against evaluating teaching, apparently on the 
.rounds that evaluation often suggests change and change that is not entirely spontaneous is not free Howev- 
S a suggestion to change is no? necessarily a requirement to change, and to extend the principle of academic 
freedom from the substance to the manner of teaching strains the traditional concept. Moreover, to use the 
principle of academic freedom as an unqualified reason for not evaluating teaching or, worse, as an excuse for 
Poor teaching, risks doing violence to other people's rights under the principle of academic responsibility. 
P There may be circumstances, however, under which the academic freedom argument against evaluating 
teaching is valid: when evaluation genuinely encumbers the pursuit or disseminatiorf^of knowledge. Thus ; the 
argument could be valid if, say, anxiety about a peer review or a student survey we* to .nhibu an instructor 
from teaching some important course material. Although the inhibition may be more the result of the 
Sector s aLety than of the evaluation itself, it is possible for evaluation to be used as a weapon g insMn 
sectors who teach, for example, 'politically sensitive courses. In bona fide cases of this sort h J^tuhon 
needs to assure that the academic freedom of neither teacher, nor students, nor colleagues is violated and that 
^akV^Sioisibaite of all are fulfilled. This is perhaps best achieved in a deliberate evaluation pro- 
cesser which the instructor, review committee members, and administrators can take into account any v.ola- 

d0nS ^Tcon7pwSophical issue is, do the instructor's credentials, position, and expertise preclude evaluation? 
One occas onally encounters the argument that, especially in the sanctity of the classroom there is no authonty 
bevond the instructor, and hence no evaluation of the instructor that could be appropnate. For the mo * part 
bowrve,- this argument fails to persuade. On the contrary one couW argue with greater force that teaching 
must be evaluated, if only to confirm or disconfirm these claims of pnvilege. 

A final philosophical issue reflects academe's, inherent cautiousness and perhaps perfectionism or fea of er- 
ror: Must national evaluation await a perfect evaluation technology? Teaching ,s still ™perfecLy und - 
stood and the instruments of evaluation are in some ways primitive. Faculty, then, who are often trained cntacs, 

ometimes direct their considerable critical powers at th.se deficiencies and propose that ^,"2^ 
wTshould do nothing." But evaluation can proceed if teaching is suffiaently well understood and evaluahon 
techno°ogy sufficiently well advanced that teaching can be responsibly albeit imperfectly assessed. Indeed 

a lu^e W evaCaS ma^ lead to sins more grievous than would evaluation with uncertainty. Instructional and 
personnel decisions are" going to be made in" any event, and these decisions are likely to be better the more they 
are enlightened by responsible evaluation procedures. 

PSyC M^ychX'amic issues in instructional evaluation reduce to the notion of control. Faculty .value = doing 
what thev 'want to do in the way they want to do it,. In recent years, however, faculty have been pressed more 
and more to change their work styles, often by taking on tasks they find distasteful or even threatening and gw- 
E? up pursuits they hold dear." Moreover, faculty are being required to furnish more and more evidence 5 sup- 
pcfrZ the amount" and quality, and even the value, of what they do. At the same time hey _&nd ^ reward 
Siminishing-salaries stagnate, grant sources disappear, time for reflection is scarce, *f^V2$%££5 
rarelv expressed. Lurking beneath these external pressures are internal stresses as wel Faculty wonder it, 
hTough al o 'this evaluation, their worsted deepest fears will finally be realized: they wul be found out 

i^Wsiators and students are experiencing much the same turmoil. For administrators, every 'budge 
cut bring? painful decisions; every move is criticized; every decision challenged even 

me cost of education continues to rise, the rewards are in serious doubt, and there is little they can do about it 
In short, faculty, administrators, and students often feel they are losing control of their uves x 

One effSct of this feeling of loss of control is a drive to put things in order. For faculty this might mean a 
pressing desife to return to a secure and simpler collegiality without the intrusiveness of evaluahon policies and j 
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procedures and without such painful responsibilities as the rejection of junior colleagues seeking tenure. For ad- 
ministrators the drive for order might manifest itself in a thirst for objective data to facilitate decisions and re- 
move the burden of subjectivity. For students it might mean a need for confidence in the quality of the educa- 
tion that is offered them and a passion for procedures and mechanisms to guarantee that quality. 

The problem is that these drives compete, and efforts to put one group's life in order may further unsettle 
anotherVand. there is no ready solution to this conflict of needs. Refining evaluation instruments is desirable, 
but overall a simplistic response to deeply held anxieties. Clarifying policies and requirements is useful but also 
misses the mark. Providing all the avenues for grievance in the world is still at best a superficial response. 
Perhaps the most one can do is try to recognize the other people's needs, show a little empathy, improve what 
can be improved, and press on. 

Psychometric-Issues a 1 

A major psychometric issue in instructional evaluation is, should the evaluation focus on instructional out- 
comes^ on instructional processes. Instructional outcomes are the results of teaching, the diverse aspects of 
student learning. Process variables are the course and instructor characteristics and instructor behaviors that 
lead to student learning. The purpose of teaching is to facilitate learning, and the goal of instructional evaluation 
is to identify who or what leads to learning. The issue is whether one can approach this goal directly, by 
measuring learning, or must approach it indirectly, by measuring course and instructor characteristics. 

The problems of evaluating teaching by measuring student learning are both philosophical and methodo- 
logical. Philosophically, it seems unfair to evaluate a person in terms of what other people do or fail to do, at 
least at the beginning of the course, and student motivation throughout the course. r If the test results were to be 
used to compare one instructor with another, the testing procedure would need to ensure that student differ- 
ences m ability, motivation, and prior achievement were controlled for^by random assignment of students to sec- 
tions (which is sometimes inconvenient, usually impossible) or by application of statistical adjustments (which 
do not work so well in practice as they do in theory, iee Glass, 1974). Second, the tests would need to be ident- 
ical or equivalent in content and difficulty in all courses or sections, so that one instructor did not seem a less ef- 
fective teacher than another simply because the latter gave an easier test. Third, there would need to be some 
way for instructors to assure themselves (and their colleagues) that the tests did indeed measure the depth and 
breadth of what their students were supposed to learn, so that people would not be misled into belie nng that a 
teacher is effective merely because the students successfully memorized a collection of facts. - Finally, triers 
would need to be some central mechanism to make sure that all the necessary controls were in place and co re- 
ceive and act on appeals. , ,\ , 
On the other hand, there is a massive literature that shows that good achievement tests can be de.ivea, 
and faculty in general do seem to consider even routine classroom tests good enough for use in important deci- 
sions about students. Moreover, it is clear that instructors do have some responsibility for whatever learning oc- 
curs in their .course. To be reasonable, then, the evaluation plan should take advantage of the strengths of class- 
room tests without losing sigRt of their weaknesses. . 

Evaluating teaching by measuring course and instructor characteristics and behaviors that are indicative or 
good teaching is certainly no less complex. Some seventy years of intense research effort have failed to produce 
a widely accepted, empirically supported theory or definition of good teaching, Recent research has made 
matters even more difficult by pointing out the importance of attending to the differential effects of teaching 
methods on different kinds of students. As unacceptable as it would be to reward or punish instructors on the 
basis of the dubious results of classroom tests, it would be equally unacceptable to reward or punish them for 
their conformity to a list of qualities that may have nothing to do with promoting learning in their courses. 

But the negative case has again been overstated. Instructional research has made significant progress to- 
ward identifying the important features of good teaching, and certain evaluation questionnaire items have been 
shown to be valid. As is the case with classroom testing, a reasonable goal would be to devise an evaluation 
plan that takes advantage of the strengths of process measures, but that also guards against their weaknesses. 

Kenneth O. Doyle, Jr. 
Research Associate 
Measurement Services Center 
University of Minnesota 

Excerpted from Evaluating Teaching. (Lexington, Massachusetts: Lexington Books, D. C. Heath and Company, c 
1983, D. C. Heath and Company.) 
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